AVERAGE PRIME-PAIR COUNTING FORMULA 



JAAP KOREVAAR AND HERMAN TE RIELE 



Abstract. Taking r > 0, let 7T2 r (x) denote the number of prime pairs 
(p, p + 2r) with p < x. The prime-pair conjecture of Hardy and Little- 
wood (1923) asserts that 7T2 r (x) ~ 2C2 r li2(x) with an explicit constant 
C2r > 0. There seems to be no good conjecture for the remainders 
ui2r{x) = TT2r(x) — 2C2 r li2 (%) that corresponds to Riemann's formula for 
7r(x) — li(x). However, there is a heuristic approximate formula for aver- 
ages of the remainders U2r(x) which is supported by numerical results. 



1. Introduction 

For r e N, let 7T 2r (x) denote the number of prime pairs (p, p + 2r) with 
p < x. The famous prime-pair conjecture (PPC) of Hardy and Littlewood 
H2l asserts that for x — > oo, 

log 2 X 



r dt 

(1.1) 7r 2r (x) ~ 2C 2r li 2 (x) = 2C 2r / — ^ ~ 2C 2r -^ 

J2 log t 1 

Here C 2 is the 'twin-prime constant', 

(1.2) C 2 = Yl jl- _j I w 0.6601618158, 

p prime, p>2 * ^ 

and the general 'prime-pair constant' C 2r is given by 

(1.3) c 2r =c 2 n Pi- 

p prime, p|r,p>2 

Assuming that the PPC is true, let w 2r (x) denote the remainder 

(1.4) oj 2r (x) = rt 2r (x) - 2C 2r li 2 (a;). 
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We have not been able to find a good approximation of the remainders 
W2 r (x) that correspond to Riemann's approximate formula for tt(x) — li(x) 
(see ( 11.91) below). Instead, by complex analysis and heuristic arguments we 
obtain the following plausible approximation for averages (1/iV) Yl r =i u 2r( x ) 
with large N, and we support the formula by extensive numerical results. 

Approximation 1.1. For N > 1 and x > N 2+s , with < 8 < 1, one has 

1 - 

M x ) - 2C 2r li 2 (x)} = -{4 + 0(N- 1 ' 2 log*)} pH^ p ) 

r=l p 

(1.5) - {1 + 0{N~ 1 ' 2 loga;)}li 2 (a; 1/2 ) + 0{x 1 '^), 

with a symmetric sum over the complex zeros p of ((s). 
To test this conjectured approximation we observe that 

E./.B.M fi a (i \\ T(x) + (i , where 



li^x 1 / 2 ) [4 \logx / J \logx 

(1.6) T(x) = Y 

p 

Neglecting the C-terms in fll.5ft and (11. 6ft . dividing by li^x 1 / 2 ), and adding 
T(x) + 1, we obtain the error function 

We have evaluated and plotted this function for fixed x = 10 6 , 10 8 , 10 10 , 10 12 
and 2 < 2N < 5000 (Figures 1-4 in Section 7), and for fixed N = 400, 2500 
and 6 < log 10 x < 12 (Figures 5-6 in Section 7). Taking into account the 
(9-terms in ( ll.5p and (11.61) . A^(x) should have the form 

0(iV- 1/2 (logx) + l/logx)T(x) + 0(iV- 1/2 (logx) + 1/logx) + 0(x- s/7 ). 

Our four plots for fixed x, and the two for fixed N, show that Approximation 
11.11 is good for large N, provided x/N 2 is large. 

When x is comparable to N 2 , the theory predicts sizeable deviation, 
roughly 

fl Q\ A f \ ~ A" f \ dcf 2iVlog 2 X 

(1.8) ^''"^''' l^giff i 

see Section [5j Behavior of this type is seen in the plots for x = 10 6 and 10 8 . 
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In connection with (JT3J) we recall Riemann's approximation for the re- 
mainder 7r(x) — \i(x). If Rep = 1/2 for all p one has 

(1.9) u{x) = ir{x) - li(x) = -J2 KOO - (V2) \i{x 1/2 ) + 0(x b ), 

p 

where b may be any number greater than 1/3. This follows from von Man- 
goldt's formula for ip(x) = J2 n<x A(n). Here A(n) denotes his function: 
A(n) = logp if n = p a with p prime, and A(n) = if n is not a prime 
power. Von Mangoldt proved that 

(1-10) ^(x)=X-> ^-r^+> — ^, 

1 ; y ' ^ p C(o) Y 2A; 

and this is exact for all x > 1 where ip(x) is continuous; cf. Davenport [8J, 
Edwards [9J, Ivic [H]. 

PART I. HEURISTICS 

2. First step towards Conjectured Approximation 11.11 
Let us start by introducing the functions 

4>2r(x) = A(n)A(n + 2r), M*) = log 2 p, 

n<x p,p+2r prime; p<£ 

(2.1) ^(x)= £ log 2 p. 

p, p 2 ±2r prime; p<x 

Partial summation or integration by parts shows that the PPC ( 11.11) is 
equivalent to each of the asymptotic relations 

(2.2) 2r (x) ~ 2C 2r x, ip2r(x) ~ 2C*2 r x as x — > oo. 

We have counted the prime pairs (p, p + 2r) with 2r < 5 • 10 3 and p < 
x = 10 3 , 10 4 , • • • , 10 12 . Table 1 is based on this work; cf. also a table in 
Granville and Martin [TT] and one by Fokko van de Bult \7\. The bottom 
line shows (rounded) values L 2 (x) of the comparison function 2C2li2(^) for 
ti 2 {x). Computations based on these prime-pair counts make it plausible 
that for every r e N and every e > 0, 

(2.3) u 2r (x) = rt 2r (x) - 2C 2r li 2 (x) = 0(x^ +£ ). 
Equivalently, there would be inequalities 

(2.4) n 2r (x) = ip 2r {x) - 2C 2r x = 0(x {1/2)+£ '), 
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2r\x 


10 3 


10 4 


10 6 


10 8 


10 10 


10 12 


C 2r / C; 


2 


35 


205 


8169 


440312 


27412679 


1870585220 


1 


4 


41 


203 


8144 


440258 


27409999 


1870585459 


1 


6 


74 


411 


16386 


879908 


54818296 


3741217498 


2 


8 


38 


208 


8242 


439908 


27411508 


1870580394 


1 


10 


51 


270 


10934 


586811 


36548839 


2494056601 


4/3 


12 


70 


404 


16378 


880196 


54822710 


3741051790 


2 


14 


48 


245 


9878 


528095 


32891699 


2244614812 


6/5 


16 


39 


200 


8210 


441055 


27414828 


1870557044 


1 


18 


74 


417 


16451 


880444 


54823059 


3741063106 


2 


20 


48 


269 


10972 


586267 


36548155 


2494072774 


4/3 


22 


41 


226 


9171 


489085 


30459489 


2078443752 


10/9 


24 


79 


404 


16343 


880927 


54823858 


3741122743 


2 


30 


99 


536 


21990 


1173934 


73094856 


4988150875 


8/3 


210 


107 


641 


26178 


1409150 


87712009 


5985825351 


16/5 


L 2 (x): 


46 


214 


8248 


440368 


27411417 


1870559867 





Table 1 . 7r 2r (x) for selected values of 2r and x 



which follow from similar estimates for 9 2r (x) — 2C 2r x. 

Our work requires a good estimate for the difference ip 2r (x) — 9 2r {x). The 
main contributions come from the prime pairs (p, p 2 ± 2r) and the sum 

PX 

{logp log(p + 2r) - \og 2 p} = / logHog(l + 2r/t) dir 2r (t). 

p, p+2r prime; p<x ^ 

Taking x > 2r and writing f£ = f£ + ^ ■• one nn( is that uniformly in r, 

ip 2r (x) - 9 2r (x) = 26* 2r (x 1/2 ) + 4rC 2r loglogx 
(2.5) + 0(rC 2r / log 2r) + 0(x 1 '* log 2 x). 



Our first goal will be to motivate the following 
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Approximation 2.1. Taking N large and x much larger than N, one has 

1 N p 
- £ iMx) - 2C 2r x} = -{4 + 0(N- 1 / 2 log*)} £ - 

r=l p P 

(2.6) + 0{N~ 1/2 x 1/2 \ogx) - {1 + o(l)}N. 

In support of this conjectured approximation we will derive a related 
conjecture involving Dirichlet series. For s = a + It with o > 1/2, set 
(2-7) 

= y A(n)A(n + 2r) = [°° = 2s fj-w^ 

Here we use denominator n 2s (and not n s ) because of the function $ A (s) in 
Theorem 13.11 and the corresponding integral in ( 18.2ft . 

By a two-way Wiener-Ikehara theorem for Dirichlet series with positive 
coefficients, the PPC in the form (12.21) is true if and only if the difference 
(2.8) 

q r < poo poo 

G 2r (s) = D 2r (s) - = / x- 2s oin 2r (x) = 2s / x- 2s - l n 2r (x)alx 



2s 

has 'good' boundary behavior as a \ 1/2. That is, G 2r [a + it) should 
tend to a distribution G 2r {{l/2) + ir} which is locally equal to a pseudo- 
function; see Korevaar [TH]. Here, a pseudofunction is the distributional 
Fourier transform of a bounded function which tends to zero at infinity. It 
cannot have poles and is locally given by Fourier series whose coefficients 
tend to zero. In particular D 2r (s) itself would have to show pole- type be- 
havior, with residue C 2r , for angular approach of s to 1/2 from the right; 
there should be no other poles on the line {a = 1/2}. 

In view of the expected estimate (12 .4p it is reasonable to suppose that 
the difference G 2r (s) is actually analytic for a > 1/4. Where would one 
expect the first singularities? Assuming Riemann's Hypothesis (RH), we 
will motivate a conjecture involving averages of functions G 2r (s): 

Conjecture 2.2. For o > 1/4 and N — > oo, one has 

N ^ G2r{3) - (As - If + 4,-1 + ° {N ] ^ W^P? 

r=l v ' p v ' 

(2.9) - {4 + 0(N-^ 2 )} + H N (s), 
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with symmetric sums over zeta's complex zeros p. The remainder H N {s) 
has 'good' boundary behavior as a \ 1/4. For large N its most significant 
part may be a term — {1 + o(l)}N. 

This conjecture motivates Approximation 1241 through formal Fourier in- 
version; cf. (12.81) . If L(c) denotes a 'vertical line' given by a = c > 1/4, 
then 

(2.10) j] 2r ( a: ) = J_ f G 2r (s)x 2s -. 

J l(c) s 



3. The theorem behind Conjecture 12.21 

To arrive at (I2.9p we start with a result for a weighted sum of functions 
D 2r (s); cf. [IE] (where there is a less precise result) and the Appendix. 
The weights are derived from an even 'sieving function' E(u), with E(0) = 
1 and support [—1,1], that can be made to approach 1 on (—1,1). A 
minimal smoothness requirement is that E{p) be absolutely continuous, 
with derivative E'(v) of bounded variation. 

Theorem 3.1. Assume RE. Then for A > and 1/2 < a < 1, 

$ A ( S ) d =Do(s) + 2 Yl E(2r/X)D 2r (s) 

0<2r<\ 

(3.1) = ^_u. AE ^ + tf W+fl * (>) . 

P 

The function Dq(s) is obtained from ( 12.71) by taking r = and the constant 
A E is given by J E[y)dv. The function S A (s) = T, x,E (s) will be described 

below, and the remainder Hq(s) = H Q ' (s) is analytic for < a < 1. For 
large A its most significant part may be a term that behaves like — A 2 /2 when 
E(v) is close to 1 on (—1,1). 

The function Dq(s) can be written as follows: 

(3.2) D (s) 



^ A^n) = 1 d_ f 02£) _ 1 S'(4*) ] 
^ n 2s 2 ds\((2s) 2 ((4s) J U ' 



where H\(s) is analytic for a > 1/6. Hence Dq(s) is meromorphic for 
cr > 1/6. Its poles there are purely quadratic, and located at s — 1/2, 1/4 
and the points p/2. Thus by ( 13. lft , and under assumption (12.41) . the pole 
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of the difference £ A (s) — D Q (s) at s = 1/2 can only be of first order. The 
residue will be given by 

(3.3) R E (l/2, A) = 2 E{2r/X)C 2r - A E X. 

0<2r<\ 

We need the important fact that the constants C 2r have mean value one. 
Stronger results were obtained by Bombieri-Davenport [5] and Montgomery 
[19] . and these were later improved by Friedlander and Goldston [10] to 



(3.4) S m = Y^C 2r = m- (1/2) logm + 0{log 2/3 (m + 1)}. 



r=l 



Partial summation in f)3.3p will thus show that for our sieving functions E, 
the residue R(1/2,X) is o(A) and in fact, O(logA) as A — ► oo. We will see 
in Section H] that the outcome o(A) - or 0(\ e ) - would follow heuristically 
from the way the parameter A occurs in the terms of S A (s). 

The description of S A (s) requires a Mellin transform associated with the 
Fourier transform E x (t) of E(v/X). For z = x + iy with < x < 1 we set 



1 / 

M\z) = - / E\t)t- Z dt 
K Jo 

(3.5) =-\*r(l-z)sm(icz/2) [ E{y)v z - X dz 

^ Jo 

= -X z T(-z - 1) sin(7rz/2) / u z+1 dE'(p). 
K Jo 

The function M x (z) extends to a meromorphic function for x > — 1 with 
simple poles at the points z = 1, 3, • • ■ . The residue of the pole at z = 1 
is -2(\/it)A e with A E = J* E(u)du, and M A (0) = 1. Furthermore, the 
standard order estimates 

(3.6) T(z) < \y\*-V 2 e -*W 2 , sin(vr^/2) < e nlyl/2 
for |x| < C and \y\ > 1 imply the useful majorization 

(3.7) M x (x + iy) < \ x {\y\ + l)^" 3 / 2 for - 1< x < C, \y\ > 1. 

Example 3.2. One may take E{y/X) equal to the Fejer kernel for R: 

1 p sin 2 (At/2) / 1-H/A forH<A, 

E{v \) = — I —. , . - cos utdt = < „ r . , 

7T 7 A(t/2) 2 \ for H > A. 
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In this case one finds 

M x (z) = -X z T(-z - 1) sin(7rz/2). 

7T 

The function S A (s). For any A > 0, the function S A (s) is given by the sum 

{W)} 2 + 2 W) ^ r(p ~ s)mX{p ~ s) cosMp ~ s)/2} 

(3.8) + T (P' - s ) r (/ " s)M\p' + p" - 2s) cos{7r(p' - p")/2}. 
p>,p» 

Here p, p' and p" independently run over the complex zeros of ({s). It is 
convenient to denote the sum of the first two terms by £ A (s); for < a < 1 
it has poles at s = 1 and the points p. For well-behaved functions M x (z), 
the double series defines a function S A (s) as a limit of square partial sums. 
Under RH the double series with our normal M A (z) is absolutely convergent 
for 1/2 < a < 3/2. Indeed, setting p' = (1/2) + zY, p" = (1/2) + ij" and 
s = a + it, the inequalities (13.61) . (13. 7h show that the terms in the double 
series are bounded by 

C(X, rXIVI + 1)- CT (|7"| + + 7"l + 1)- 1+2CT - 3 / 2 . 

Observing that the number of zeros p = (1/2) ± i*y with ra<7<n + lis 
O(logn), the convergence now follows from a discrete analog of the following 
simple lemma; cf. Korevaar [15] . 

Lemma 3.3. For rea/ constants a, b, c, the function 

(f) (y jV ) = (\y\ + l)-«(\v\ + l)- b (\y + v\ + l)- C 

is integrable over M 2 if and only if a + 6 > 1, a + c > 1, 6 + c> 1 and 
a + 6 + c > 2. For integrability over £/ie condition a + b > 1 may fee 
dropped. 

By the lemma, the part of the double sum S A (s) in (I3.8P in which 7' = 
Im p' and 7" = Im p" have the same sign defines a meromorphic function 
for < o < 1 whose only poles occur at the complex zeros of ((■). Thus 
for a study of its pole-type behavior near the point s = 1/2, the sum 
S A (s) in (13.81) may be reduced to the sum Sg(s) in which 7' and 7" have 
opposite sign. Replacing 7" by —7" and using standard asymptotics for the 
Gamma function, it follows that the pole-type behavior of S A (s) and S A (s) 
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as s \ 1 /2 is the same as that of the reduced sum 

(3.9) E A (s) = 2tt (7Y) _s+i(7 '" 7 " )/2 M X {1 -2s + i(i - 7 ")}. 

7'>0,7">0 

Hence in the study of the PPC under RH, the differences of zeta's zeros in, 
say, the upper half-plane play a key role; cf. Montgomery [20] . 

Formally, the poles of £ A (s) at the points s = p cancel each other. Under 
assumption (12 .4ft . the function £ A (s) nas a meromorphic continuation to 
the half-plane {a > 1/4}, and then there will be real cancellation; see (13.11) . 

4. Motivation of Conjecture 12.21 

As we saw, numerical results make it plausible that the functions G2 T {s) 
have an analytic continuation to the half-plane {a > 1/4}. Hence by The- 
orem EH] and (13.31) . assuming RH, the function 

y\s) = 2 J2 E(2r/X)G 2r (s)+AA E \J2^z- 

0<2r<\ p S P 

(4.1) = e\ S ) - Dow - 2RE £!_\ X) + H ° is) 

will also have an analytic continuation to the half-plane {a > 1/4}. This 
shows that the quadratic pole l/(2s — l) 2 of D (s) at s = 1/2 must be 
cancelled by a pole of S A (s) at s = 1/2. By (13.81) the latter pole is the same 
as that of the double sum S A (s), and hence, of S A (s) in (13.91) . 

Computation suggests that the quadratic part of the pole at s = 1/2 in 
S A (s) comes from the terms with 7" = 7'. Indeed, for s = (1/2) + 5 with 
small 5 > 0, and using the counting function N(t) of zeta's complex zeros, 

/oo 
r l - 25 dN(t) 

J_ logA + C?(l) 
45 2 25 

The residue of the pole of S A (s) at s = 1/2 is R E (l/2, A). We know that 
under assumption (12.41) . the residue is 0(\ £ ). Independently of (12.41) . this 
may be deduced heuristically from the fact that A occurs in the terms of the 
series for £ A (s) only as \p' + p"~ 2s i which is 0(X e ) for s « 1/2. 

We now turn to the likely behavior of l I /A (s) in (14. ip near the line L = 
{a = 1/4}. Since D (s) has quadratic poles at the points s — 1/4 and 
p/2, and no other poles on L, cf. (13 .2\\ . we assume that the (meromorphic 
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continuation of the) double sum S^l 5 ) likewise has poles at 1/4 and the 
points p/2, and nowhere else on L. This assumption is plausible because it is 
known to be true for A < 2, when the sum over r in (14. Xp is empty, so that the 
difference (s) — D (s) has no poles on L other than first order poles at the 
points p/2. If the heuristic argument in the preceding paragraph has general 
validity, one expects that the coefficients of the pole terms of S^s) at 1/4 
and the points p/2 are O^X 1 ^ 2 ), or in any case (9(A (1/2)+£ ) for every e > 0. 
Indeed, the terms in S^s) contain A as a factor X p ' +P "~ 2s = 0(X^' 2 ' +£ ) for 
a\l/4. 

Hence by ( 14. ip . taking coefficient bound 0(X 1 ^ 2 ) for simplicity, the sum 
2J2r=i E(r/N)G 2r (s) should behave like 

C(A 1/2 ) 0{X 1 ' 2 ) 



(4 S _1)2 4 S _i 
near the point s = 1/4, and like 

C(A 1/2 ) AA E X + 0(X 1 / 2 ) 



(2s -p) 2 2s -p 

near the points s = p/2. Assuming uniformity here relative to p, and taking 
A = 2N, the singular part of the average 

1 N 

-^E(r/iV)G 2r ( S ) 

r=l 

for a > 1/4 will have the form 

r=l v ' p v ' 

- {4A £ + ^(iV- 1 / 2 )} ^ — L_ + H N (s). 



The remainder H N,E (s) will have good boundary behavior as a \ 1/4 and 
it contains 1/(2N) times the remainder (s) from Theorem 13.11 with 
A = 2N. Now taking E(v) close to the function which is equal to 1 on 
[— 1, 1] and elsewhere, one is led to Conjecture 12.21 

This conjecture, finally, makes the conjectured Approximation 12.11 plau- 
sible through formal Fourier inversion (12.101) . 
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5. From Approximation 12.11 to Approximation 11.11 via 

Approximation 15.21 

After motivating Approximation |2~T1 for averages of functions ip2r{%), we 
turn to a corresponding approximation involving the functions $2 r {x)- For 
large N and x > 2N, cf. ( 1231) . 

N N N 

r=l r=l r=l 

1 * 

(5.1) - — 4rL7 2 rloglogx + C(x 1/3 log 2 x) + o(N). 

r=l 

According to the Bateman-Horn conjecture [SJ, [3], applied to the special 
case of prime pairs (p, p 2 ± 2r), there should be specific positive constants 
2C* r = eg + eg 21 such that 

(5.2) 9* 2r (x) = {2C; r + o(l)}x as x -> oo. 

Here there is no need to study the Bateman-Horn constants in detail; our 
only concern will be their mean value (apparently equal to one). 

Conjecture 5.1. For x — ► oo one /ias 

1 N 

(5.3) ^£^(s)M2 + °(AT 1/a )}*. 

r=l 

Combining this conjecture with (15.11) and Approximation I2TT1 one obtains 
the (conjectured) 

Approximation 5.2. For large N and x much larger than N, one has 

1 N p 

(M*) ~ 2C W = -I 4 + °( N ' 1/2 logx)}J2--{^ + o(l)}N 

r=l p P 

(5.4) 



1 

- {4 + 0(iV~ 1/2 logx)}x 1/2 - — 4rC 2r logloga; + C(a; 1/3 log 

r=l 

To go from here to Approximation 11.11 we integrate by parts: 

N N 



r=l r=l ^ z to 



,, ^ d92r(t). 
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In the evaluation it is assumed that contributions due to derivatives of the 
C-terms can be neglected. The sum on the left of (15.41) then becomes the 
sum on the left of (jl.5p . Ignoring the log log x-term for a moment, the right- 
hand side of (15. 4p then gives the right-hand side of (II. 5p . The log log x-term 
(with its minus sign) ultimately leads to a contribution 

,, , N 



(5.5) - i- ArC 2r [ t\~ + d log log t ^ , 

N ^ J2r log t 



log z 2N 



To assess its effect on A N (x) in (II. 7p . one still has to divide by li 2 (x 1 / 2 ) ~ 
4x 1//2 / log 2 x. The result An(x) in ( 11.81) will be small when x is much larger 
than N 2 . 

In support of Conjecture 15.11 we proceed with a conjecture involving re- 
lated Dirichlet series 

(5.6) D* r (s) = Yl = r x ~* Sd6 2r(x) (o- > 1/4). 

2 ,„ P Jl 

p, p z ±2r prime 

Conjecture 5.3. For a > 1/4 and N — > oo, one has 

r=l 

an analytic function H^s) that has good boundary behavior as o \ 

1/4. 

The arguments supporting Conjecture 15.31 are similar to those given for 
Conjecture 12.21 For l/4<cr<l/2 one may write 



$t 2 (s) = ZW + 2 £ E(2r/\)DUs) 

0<2r<A 



(5-8) = + ^ J(s, s) + J2 fiiPm - *}J(p/2, *) + Hfo), 

where Dq(s) = J2 P ( lo § 2 P) /P 4s and H ^(s) is analytic for 1/4 < o < 1/2. 
The functions J(s,s) and J(p/2,s) are analytic for 1/4 < a < 1/2; cf. 
(18. 4p in the Appendix. Hence, formally the poles at the points s = p/2 in 
the combination of J-terms in (15. 8ft will cancel each other. However, one 
constituent of J(p/2,s) is an infinite series. It leads to a repeated series 
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£22(3) when it is substituted into the sum over p in (15. 8|) : 

s 2, 2 (^) = ^E r {^/ 2 )- s }E r ^- s ) 

p p' 

x M x {p + (p/2) - 2s} cos{7r(p' - p/2)/2}. 

This series is absolutely convergent only for 3/8 < a < 1/2; for 1/4 < a < 
3/8 the sum over p = (1/2) + ry has to be interpreted as a limit of partial 

sums J2\~f\<B as B ~* 00 • 

In view of the similarity of the Hardy-Littlewood conjecture and our 
case of the Bateman-Horn conjecture, it is reasonable to suppose that the 
differences 

9C* 

Gt(s) = DUs) - JJZT 

have an analytic continuation to the half-plane {a > 1/4}. If that is correct, 
the combination of the J-terms in (I5.8P truly has no poles at the points 
s = p/2. The repeated sum E2 2 (s) then would have an analytic continuation 
to the strip 1/4 < < 1/2, except for a pole at s = 1/4. The quadratic 
pole l/(4s — l) 2 of Dq(s) at s = 1/4 would be cancelled by the quadratic 
part of the pole of £2 2 ( s ) there. Finally, the residue of the pole of S^ 5 ) 
at s = 1/4 would be (9(A (1/4)+£ ) by heuristics as in Section HI Hence by 
(15.8p . the residue at s = 1/4 of 

1 N 1 
-J2 E(r/N)D* r (s) would be -A E + o(N-^ 2 ), 

r=l 

thus leading to ( 15. 7ft when E(y) is taken close to 1 on (—1, 1). 

The proof of (15.81) is similar to that of (13. ip described in the Appen- 
dix. Here, one would start with the integral obtained from (18.21) through 
replacement of one of the quotients C'(')/C(') by C'(2 -)/C(2 •)■ 
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PART II. NUMERICAL RESULTS AND GRAPHS 

6. Comparing averages of functions cu 2r (x) with u(x) 

Which of the two terms on the right-hand side of (11.51) . in the conjectured 
Approximation 11.11 is larger? One may write 

> ph 2 (x p ) = — ^> + 0\ — =— 

^ H V ; log 2 ^^ p \\og 3 x 

p ° p x o 

1/2 / 1/2 

lh( x L/*) = 4JL— + o( 



log 2 x V log 3 x 
Hence if we set 



,!,.]• xP \ cos(7loga;) + 27sin(7loga;) 



, 6 ,) tw*-z~y: 7 , +1/4 

p y 7>0 ' 1 

relation ( 11. 5ft takes the form 

n f v def Er=l ^2r(» 

(6.2) = -[{1 + 0(iV- 1 / 2 logx)}T(x) + l + C(A^- 1/2 logx)]. 

It is interesting that Riemann's formula ( 11.91) leads to a combination 
similar to the right-hand side of (16.21) . Indeed, assuming RH one may write 

(6.3) = - [{1 + 0(1/ \ogx)}T(x) + 1 + 0(1/ logs)] . 

Littlewood's work [18], cf. Ingham [13], implies that the function T(x) os- 
cillates unboundedly. More precisely, he showed that there are constants 
c, d > and arbitrarily large x, x' such that 

T(x) < — c log log log x, T(x) > d log log log x' . 

However, n(x) becomes larger than li(a;), that is, u(x) > 0, only for certain 
very large x. The first such number is associated with the name of Skewes; 
cf. te Riele [22], and Bays and Hudson [4J. Under RH one has T(x) = 
0(\og 2 x) and Kotnik [17] made it plausible that T(x) = 0(\ogx). He 
also graphed the function cu(x)(\ogx)/x 1 ^ 2 , cf. ( 16.31) . for x < 10 14 . On a 
logarithmic scale, his Figure 1 shows rapid oscillations of amplitude greater 
than 1/2. 

The Skewes story seems to have no analog for prime pairs, cf. Brent [6]. 
Here we focus on the case of twin primes. Nicely [21] has counted prime 
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twins up to x = 10 16 . His table uses steps 10 fc from 1 • 10 fc through 9 • 10 k 
for k — 1, 2, • • • , 12. From there on the steps are 10 12 . Nicely's table shows 
that for x going to 10 16 , the quantity |k>2(x)| often becomes a good deal 
larger than h^x 1 / 2 ). His table implies 16 sign changes of ^(x) [which is 
minus his entry (^(x)]. The first occurs between 10 6 and 2 • 10 6 , the last 
between 7.5 • 10 13 and 7.6 • 10 13 . Although ^(x) oscillates, it then remains 
positive until the end of Nicely's table. 

In a preprint on a 'Skewes number for twin primes', Marek Wolf [2U] 
analyzed the sign changes in u>2(x) up to 2 42 « 4.4 • 10 12 . He found the 
first one at the twin with p = 1369391. A table in his preprint lists the 
number of sign changes up to 2 fc for k = 22, 23, • • ■ ,42. Wolf found 90355 
sign changes up to 2 42 . He found none between 2 22 and 2 25 , none between 
2 28 and 2 31 , and none between 2 37 and 2 39 . 

In our range of x, the values of |T(x)| are smaller than one. In particular 

T(10 6 ) « 0.41156, T(10 8 ) « 0.17554, 

(6.4) T(10 10 ) « -0.42122, T(10 12 ) « -0.04014. 

These values were computed with the aid of von Mangoldt's formula (11.101) . 
by which (for x > 1 and x not a prime power) 

(6.5) T(x) = x- l/2 {x - ${x) - log(27r) - (1/2) log(l - aT 2 )}. 

The function if)(x) = J2 p ™< x ^°&P was computed by summing the values 
of |_l°g p x\ logp for all the primes p < x (generated with the sieve of Eratos- 
thenes). Here, |_log p xJ is the exponent of p in the highest power of p not 
exceeding x. The values of T(x) given in ( 16 Ah were computed with an ac- 
curacy of at least 5 decimal digits. We were using Fortran double precision 
floating point arithmetic which works with an accuracy of about 15 decimal 
digits, but precision is lost as x grows when ( 16. 5ft is used to compute T(x). 
To illustrate this, we found that ^(10 12 ) = 1000000040136.76, so that in 
the difference 10 12 — ^(lO 12 ) = —40136.76 only about seven digits are still 
correct and T(10 12 ) = -0.04013860. 

Alternative computations based on formula (16. ip and the first two million 
values of 7 gave the values T(10 6 ) w 0.41276, T(10 8 ) w 0.17469, T(10 10 ) w 
-0.41944, and T(10 12 ) w -0.04010, i.e., an accuracy of only about 3 decimal 
digits. 
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7. Testing the conjectured Approximation 11.11 
In the following we will consider the aggregate 

(7.1) ^(x) = f TT 2 (x) + 7T 4 (x) H \~TC 2N (x) 

for certain large values of N and x. Setting 

(7.2) Sjv = C*2 + C4 + ■ ■ • + C2N1 

cf. (13.41) . we compare 11^- (rr) with 25Vli2(^) = {Sn/C 2 )L 2 (x). In view of 
the conjectured Approximation [LT], the difference is divided by N\i 2 (x 1 ^ 2 ) 
to obtain the quotient 

/ 7 oX njy(x) - (S N /C 2 )L 2 (x) _ J2r=l ^rjx) _ ( . 

iVli 2 (xV2) " iVli 2 ( x i/2) 

cf. (16.21) . For large iV the quotient should have the form 

-{1 + 0{N~ 1 ' 2 logx)}T(x) - {1 + 0(N~ 1 ' 2 logs)} 

Ignoring the O-terms, we will compare Qn{%) with —T(x) — 1, setting 

(7.4) Q N {x)+T{x) + l = A N {x). 

Tables 2, 3 give results for x = 10 6 , 10 8 , 10 10 , 10 12 . The values S N /C 2 
were obtained by computing C 2r /C 2 from f I X . 3 p and adding. For the values 
of Hn(%) we added columns of numbers vr 2r (x). We next computed Qn(x) 
from ( 17. 3p . Here we used the approximations 

L 2 (10 6 ) « 8248.0297, L 2 (10 8 ) « 440367.7942, 
L 2 (10 10 ) « 27411416.53, L 2 (10 12 ) « 1870559866.82 

and 

li 2 (10 3 ) « 34.6851, li 2 (10 4 ) » 162.2412, 
li 2 (10 5 ) » 945.75959, li 2 (10 6 ) « 6246.9757. 

The table entries An{x) are based on ( 17.4p and the approximations for T(x) 
in (6.4). 

In Figures 1-4 we show plots of Ajv( function of N (50 < 2N < 

5000), for x = 10 6 , 10 8 , 10 10 , and 10 12 . We have omitted the function 
values for 2 < 2N < 48 since they very much dominate (and are atypical 
for) the other function values. In Figures 1 and 2 we compare A^(x) with 

the function A^(x) = — §^1/2^^ as defined in ( 11.81) . 
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2N 


Sn/ C*2 


Iijv(10 6 ) 


Ajv(10 6 ) 


ILv(10 8 ) 


Ajv(10 8 ) 


100 


73.6377551 


605087 


+0.09722 


32417440 


-0.08872 


200 


149.3252708 


1226667 


-0.02199 


65739481 


+0.03162 


300 


225.4407734 


1851433 


-0.12785 


99245855 


-0.09833 


400 


300.3132204 


2465581 


-0.23344 


132202659 


-0.23013 


500 


376.0636735 


3086695 


-0.32860 


165551273 


-0.18188 


600 


452.4693143 


3714028 


-0.31371 


199186203 


-0.19507 


700 


527.3827110 


4328507 


-0.34805 


232164862 


-0.18926 


800 


603.4536365 


4951873 


-0.42140 


265651152 


-0.21737 


900 


679.4011178 


5574196 


-0.48004 


299079601 


-0.28690 


1000 


754.4223630 


6188960 


-0.52230 


332105577 


-0.27582 


2000 


1511.5853400 


12391586 


-0.78001 


665435604 


-0.16751 


3000 


2269.6853566 


18597363 


-0.95390 


999175096 


-0.14446 


4000 


3026.0445409 


24783891 


-1.11135 


1332114654 


-0.23565 


5000 


3783.8474197 


30975067 


-1.28953 


1665693721 


-0.28111 



Table 2. Values of S N /C 2 , IM10 6 ), A 7V (10 6 ) and LTv(10 8 ), A 7V (10 8 ) 



2N 


IM10 10 ) 


A,v(10 10 ) 


iMio 12 ) 


A iV (10 12 ) 


100 


2018498733 


+0.23101 


137743459486 


-0.22449 


200 


4093181354 


+0.19981 


279320931774 


-0.52374 


300 


6179575427 


+0.04646 


421698995095 


-0.60678 


400 


8231900717 


-0.00307 


561752066806 


-0.47345 


500 


10308323520 


+0.09461 


703447298670 


-0.52336 


600 


12402663153 


+0.00891 


846368266787 


-0.46665 


700 


14456137134 


+0.06512 


986498011024 


-0.37686 


800 


16541312091 


+0.03187 


1128792535379 


-0.48827 


900 


18623097684 


-0.00710 


1270856645797 


-0.39850 


1000 


20679532323 


+0.04311 


1411187901897 


-0.41454 


2000 


41434008965 


-0.14700 


2827502930522 


-0.31142 


3000 


62214267139 


-0.14273 


4245571295213 


-0.21865 


4000 


82946817735 


-0.13473 


5660383932743 


-0.12392 


5000 


103718886923 


-0.15324 


7077896171945 


-0.12569 



Table 3. Values of LTv(10 10 ), A^IO 10 ) and LTv(10 12 ), A 7V (10 12 ) 
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We have made, but not given here, plots of A^{x) for several other values 
of x. E.g., for x = 10 11 and 2N = 1000,2000,3000,4000,5000, we found: 
A N {x) = -0.229,-0.072, -0.034, +0.004, and -0.034, respectively (com- 
pare these values with the corresponding values for x = 10 10 and x = 10 12 
in Table 3). 

Figures 5 and 6 show plots of An(x) as a function of x (6 < log 10 x < 12), 
for N = 400 and N = 2500, respectively. The plots have been constructed 
by connecting the values of An(x) for x = 10 6 and for x = i x 10 3 ', j = 
6, 7, . . . , 11 and i — 1, 2, . . . , 10 by straight lines. The different behaviour of 
the plots of A 400 (a;) and A 2 5oo(x) may reflect the influence of the unknown 
0(N~ 1 / 2 logx)-terms, which were neglected in the derivation of the error 
function A^(x) from (11.51) . 
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Figure 5. A 400 (a;) for 6 < log 10 a; < 12 
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Figure 6. A 250 o(X) for 6 < log 10 x < 12 
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APPENDIX 

8. Outline of the proof of Theorem 13.11 

Extending an idea that goes back to Arenstorf pQ, cf. [16], one is led to 
a representation for E{(a — /3)/\} by an absolutely convergent repeated 
complex integral in which a > and (5 > occur separately: 



E{(a-P)/\} = t^t^ / T{z)a- Z dz I T(w)P~ 

' " ' ' 'L(c,B) JL(c,B) 



[2m 



L(c,B) 



(8.1) x M x (z + w)cos{tc(z - w)/2}dw. 

Here the path L(c, B) = L(ci, C2, B) in the z = x + iy plane is taken of the 
form 

the half-line {x = Ci, — oo<y<—B} 

+ the segment {c\ < x < C2, y = — B} 

+ the segment {x = C2, —B < y < B} 

+ the segment {c2 > x > C\, y — B} 

+ the half-line {x = c\, B < y < oo}, 

and similarly for the w = u + iv plane. For — 1/2 < c\ < < c 2 < 1/2, say, 
and arbitrary B > 0, the absolute convergence of the repeated integral in 
(18. ip follows from i\3.6L (13. 7h and Lemma [3731 

For the verification of formula (18. 11) one may write cos at as a complex 
(inverse) Mellin integral involving T(z), and cos fit as such an integral in- 
volving r(w). Multiplying the two, doing the same with sines and adding, 
one obtains a repeated complex integral for cos(a — f3)t. It is multiplied by 
E x (t); integration over < t < oo and use of (13.51) then gives the result. 

Formula (18.1 p leads to the following integral for $ A (s) in ( 13.11) . modulo a 
function H x (s) that turns out to be analytic for a > 0: 

^(s) = ^ I T(z) C 'S Z + S ) dz ! FM C ' (W + S) 



(2m) 2 J L{c B) ((z + s) J L[c B) ((w + s) 

(8.2) x M x (z + w) cos{tt(z - w)/2}dw + H x (s). 

For verification one introduces the Dirichlet series ^2A(k)k"' and A(Z)Z"" 
for the quotients C'/C- O ne then integrates term by term, initially taking 
a > 1 + |ci|. The result 

fcj=l 
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differs from $ A (s) in (13.11) by 

r 1 i ^ 

>E(2r/X) 



f 1 

H\s) = 2 y £A(n)A(n + 2r) \-Ts 

0<2r<A n=l l - 



n s {n + 2r) f 



2 y y " W / H 1 ^ {(2r-l)/A}. 

0<2r-KA n=l v ' 



A(n)A(n + 2r - 1) 

~ z 

0<2r-l<A n=l 

Thus H x (s) is analytic for a > 0. One may verify that it is the Mellin 
transform of a function h x (x) which is 0{X(1 + X 2 /x) log 2 x} for x > X. 

Analytic continuation shows that under RH, one may take paths L(c, B) 
in ( 18.21) with C\ = —rj and = (1/2) — rj, where < i] < 1/2. Thus the 
integral representation may be used for s = a + ir with a > (1/2) + rj and 
\r\ < B; cf. [16J. Additionally requiring a < 1, we now move the paths 
L(c, B) across the poles at the points 1 — s, and p — s to lines L(d), given 
by x or u equal to d = —(1/2) The moves may be justified by Cauchy's 
theorem and the estimates in Lemma 13.31 On the relevant vertical lines, 
(CV C) (Z) only grows logarithmically in Y , and auxiliary horizontal segments 
can be suitably chosen between zeta's complex zeros. 

First moving the w-path one obtains a new repeated integral, along with 
a single 'residue-integral'. It is convenient to write the latter in the form 

(8.3) -L/ r(z)^J(z + V )«fe, 

where by the residue theorem 

J(z + s,s) = -r(l - s)M x (z + l-s) cos{tt(z + s- l)/2} 

(8.4) + ^M A (^)cos(^/2) 

+ y T(p - s)M x (z + p- s) cos{tt(z + s- p)/2}. 
p 

Next move the z-path L(c, B) in the new repeated integral and the z-path 
in the single integral to the line L(d). Thus we obtain another repeated 
integral, now involving two paths L(d), and a single integral with path 
L(d), where d = —(1/2) + rj. Varying rj G (0, 1/2), one sees that the new 
integrals represent analytic functions for —1/2 < a < 1. The operation on 
the repeated integral produces a harmless residue, namely, another copy of 
the single integral with path L(d). However, the operation on the single 
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integral yields the following residue: 

(8.5) - r(i - s) J(l, s) + ^ J(s, s) + J2 r (p - s)J(j/, s). 

Working out this residue with the aid of ( 18 Ah one obtains nine terms. Four 
of these supply the sum S A (s) of (13.81) in (13.11) . The remaining five terms 
combine into the sum 

V\s) = T 2 (l - s)M\2 - 2s) - 2r(l - s) M\l - s) sin(vrs/2) 

(8.6) - 2r(l -s)^2 T(p - s)M A (l + p - 2s) sin(7rp/2). 

p 

Here, the apparent poles at the points s = p cancel each other. The first 
term provides the important pole-term at the point s = 1/2 in ( 13.11) . Indeed, 
by the pole-type behavior of M X (Z) at the point Z = 1 (Section [2]), 

?A E \ 

(8.7) T 2 (l - s)M\2 - 2s) = ^-^ + Hfo), 

where H^(s) is analytic for — 1/2 < a < 1. The final term in (18.61) generates 
simple poles at the points s = p/2. A short computation shows that the 
residues at those poles are all equal to — 2A E X, thus leading to the term 
-4A E XY ip l/(2s-p)m^. 

To round out the proof of Theorem 13.11 we evaluate the inverse Mellin 
transform of V x (s) in (18.61) . Taking c = (1/2) + rj and x > A, and moving 
the path to the left, one finds that 

— / V x (s) x 2s — = 2A E \ x - AA E \ Y — + g{x, A). 
2m J L(c) s ^ p 

Here for large A and xj A — ► oo, 

g (x,X) = -{1 + o(l)}A 2 I E{v)vdv^ -A 2 /2 

Jo 

when E{y) is close to 1 on (—1, 1). 

By our hypothesis the double sum S A (s) in (13. 8p generates a pole at 
s = 1/2 with residue R E (1/2,X) given by (13. 3p . Through Mellin inversion 
this becomes 2R E (l/2, \)x; when added to 2A E \x, it gives the principal 
part 

2 E(2r/X)-2C 2r x of 2 ^ E(2r/ \)ip 2r (x). 

0<2r<\ 0<2r<\ 
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Other plausible contributions of E^s) have been discussed in Section HI 
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